An Investigation of Automatic Phonetic-Unit Selection for Forensic Voice Comparison
نویسندگان
چکیده
A hybrid Hidden Markov Model (HMM) Gaussian Mixture Model (GMM) system was proposed to automatically select tokens of /iau/, /ai/, /ei/, /m/ and /n/ in a database of recordings of Standard-Chinese speech collected under studio-clean, mobile-landline degraded and mismatched recording conditions. The FVC systems constructed were all MFCC GMM-UBM systems, but based on different portions of the recordings. Fusion of an FVC system based on portions of the recordings within manual /iau/ markers with a baseline system based on all speech-active segments of the recordings resulted in a relatively large improvement in validity in all three conditions.
منابع مشابه
Forensic Voice Comparison Using Chinese /iau/
An acoustic-phonetic forensic-voice-comparison system extracted information from the formant trajectories of tokens of Standard Chinese /iau/. When this information was added to a generic automatic forensic-voice-comparison system, which did not itself exploit acoustic-phonetic information, there was a substantial improvement in system validity but a decline in system reliability.
متن کاملForensic Voice Comparisons in German with Phonetic and Automatic Features Using Vocalise Software
In this article, we present a novel forensic speaker recognition system that provides the capability to perform comparisons using both ‘traditional’ forensic phonetic parameters and ‘automatic’ spectral features in a semior fully automatic way. We evaluate this approach with simulated and real forensic case data in German, which ranges from high quality laboratory audio data to real telephone i...
متن کاملThis is an author produced version of Mapping across feature spaces in forensic voice comparison: the contribution of auditory-based voice quality to (semi-)automatic system
In forensic voice comparison, there is increasing focus on the integration of automatic and phonetic methods to improve the validity and reliability of voice evidence to the courts. In line with this, we present a comparison of long-term measures of the speech signal to assess the extent to which they capture complementary speaker-specific information. Likelihood ratiobased testing was conducte...
متن کاملMapping Across Feature Spaces in Forensic Voice Comparison: The Contribution of Auditory-Based Voice Quality to (Semi-)Automatic System Testing
In forensic voice comparison, there is increasing focus on the integration of automatic and phonetic methods to improve the validity and reliability of voice evidence to the courts. In line with this, we present a comparison of long-term measures of the speech signal to assess the extent to which they capture complementary speaker-specific information. Likelihood ratiobased testing was conducte...
متن کاملDatabase selection for forensic voice comparison
Defining the relevant population to sample is an important issue in data-based implementation of the likelihood-ratio framework for forensic voice comparison. We present a logical argument that because an investigator or prosecutor only submits suspect and offender recordings for forensic analysis if they sound sufficiently similar to each other, the appropriate defense hypothesis for the foren...
متن کامل